Engineers, Aware! Commercial Tools Disagree on Social Media Sentiment: Analyzing the Sentiment Bias of Four Major Tools

نویسندگان

چکیده

Large commercial sentiment analysis tools are often deployed in software engineering due to their ease of use. However, it is not known how accurate these are, and whether the ratings given by one tool agree with those another tool. We use two datasets - (1) NEWS consisting 5,880 news stories 60K comments from four social media platforms: Twitter, Instagram, YouTube, Facebook; (2) IMDB 7,500 positive negative movie reviews investigate agreement bias widely used (SA) tools: Microsoft Azure (MS), IBM Watson, Google Cloud, Amazon Web Services (AWS). find that assign same on less than half (48.1%) analyzed content. also AWS exhibits neutrality both datasets, bi-polarity dataset but dataset, MS exhibit no clear have dataset. Overall, has highest accuracy relative ground truth Findings indicate psycholinguistic features especially affect, tone, adjectives explain why disagree. Engineers urged caution when implementing SA for applications, as selection affects obtained labels.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Challenges of Evaluating Sentiment Analysis Tools on Social Media

This paper discusses the challenges in carrying out fair comparative evaluations of sentiment analysis systems. Firstly, these are due to differences in corpus annotation guidelines and sentiment class distribution. Secondly, different systems often make different assumptions about how to interpret certain statements, e.g. tweets with URLs. In order to study the impact of these on evaluation re...

متن کامل

Potential and Limitations of Commercial Sentiment Detection Tools

In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. In addition to the quality analysis (measured by various metrics), we also investigate the effect of increasing text length on the performance. Finally, we show that combining all tools using machine...

متن کامل

Meta-Classifiers Easily Improve Commercial Sentiment Detection Tools

In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. The best commercial tools have average accuracy of 60%. We then apply machine learning techniques (Random Forests) to combine all tools, and show that this results in a meta-classifier that improves ...

متن کامل

Benchmarking Twitter Sentiment Analysis Tools

Twitter has become one of the quintessential social media platforms for user-generated content. Researchers and industry practitioners are increasingly interested in Twitter sentiments. Consequently, an array of commercial and freely available Twitter sentiment analysis tools have emerged, though it remains unclear how well these tools really work. This study presents the findings of a detailed...

متن کامل

A Sentiment-Aware Approach to Community Formation in Social Media

Participating in a community exemplifies the aspect of sharing, networking and interacting in a social media system. There has been extensive work on characterising on-line communities by their contents and tags using topic modelling tools. However, the role of sentiment and mood has not been studied. Arguably, mood is an integral feature of a text, and becomes more significant in the context o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ACM on human-computer interaction

سال: 2022

ISSN: ['2573-0142']

DOI: https://doi.org/10.1145/3532203